Prediction of prosodic phrase boundaries considering variable speaking rate

نویسندگان

  • Yeon-Jun Kim
  • Yung-Hwan Oh
چکیده

This paper proposes a model for predicting the prosodic phrase boundaries of speech with variable speaking rates. Speakers can produce a sentence in several ways without altering its meaning or naturalness, i.e., a sequence of words can have a number of prosodic phrase boundaries. There are many factors which in uence the variability of prosodic phrasing, such as syntactic structure, focus, speaker di erences, speaking rate and the need to breathe. In this work, we adopt dependency grammar, similar to link grammar, to e ciently combine speaking rates. The proposed model reduced prosodic phrase boundary prediction error by 20% compared the model using only syntactic informations. We show a potential way to make use of a read speech corpus in the training of prosodic phrasing for spontaneous speech. The proposed model is expected to make synthesized speech more natural, and improve the robustness of spontaneous speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hierarchical Stochastic Model for Automatic Prediction of Prosodic Boundary Location

Prosodic phrase structure provides important information for the understanding and naturalness of synthetic speech, and a good model of prosodic phrases has applications in both speech synthesis and speech understanding. This work describes a statistical model of an embedded hierarchy of prosodic phrase structure, motivated by results in linguistic theory. Each level of the hierarchy is modeled...

متن کامل

Variation in glottalization at prosodic boundaries in clear and plain lab speech

Previous research on glottalization shows that this voice quality occurs more frequently at prosodic boundaries than in the middle of prosodic phrases. This study investigates ten speakers’ use of glottalization at prosodic boundaries in five passages read in both clear and plain lab speech. We analyzed each syllable in every passage for its voice quality (glottalized or modal) and for its pros...

متن کامل

Automatic prosodic labeling of accent information for Japanese spoken sentences

This paper describes a method of automatic labeling of prosodic information focusing on accent types and accent phrase boundaries for Japanese spoken sentences. They are predicted by CRF (Conditional Random Fields) using linguistic information and F0 contour information. In the prediction of the accent type, we propose a method that uses a provisional accent type predicted by linguistic informa...

متن کامل

Phonological Phrase Boundaries Restrictions in Lexical Access by BP Adult Speakers

This study investigates the role of prosodic unit boundaries in on-line lexical access by Brazilian Portuguese adult speakers. Two types of prosodic constituents are considered: prosodic words (ω) and phonological phrases (ɸ). Motivated by French experimental results, we proposed two experiments in order to examine on-line lexical access in auditory sentences, considering that prosodic unit bou...

متن کامل

Effects of the Native Language on the Learning of Fundamental Frequency in Second-Language Speech Segmentation

This study investigates whether the learning of prosodic cues to word boundaries in speech segmentation is more difficult if the native and second/foreign languages (L1 and L2) have similar (though non-identical) prosodies than if they have markedly different prosodies (Prosodic-Learning Interference Hypothesis). It does so by comparing French, Korean, and English listeners' use of fundamental-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996